0000000000000000000000000000000000000000 0f6158aaf7269bd7f8cc64c1f78c27d616758ce1 liutao.0220 <liutao.0220@bytedance.com> 1743665108 +0800	clone: from https://github.com/om-ai-lab/VLM-R1.git
0f6158aaf7269bd7f8cc64c1f78c27d616758ce1 976ea6d39dfe7731554f0d648f118f4f9cd97be6 Leon022 <leonliu022@163.com> 1744529624 +0800	commit: first commit
976ea6d39dfe7731554f0d648f118f4f9cd97be6 51ce572f18ad7f4e33bba0577f334fe52583a0d6 liutao.0220 <liutao.0220@bytedance.com> 1745312594 +0800	commit: add think dpo loss
51ce572f18ad7f4e33bba0577f334fe52583a0d6 dd44c0451d3c5b91c4cd1bc64461f93c12aca3ab liutao.0220 <liutao.0220@bytedance.com> 1745313310 +0800	commit: add think dpo loss v2
